BIRC5 expression by race, age and clinical factors in breast cancer patients

Purpose Survivin/BIRC5 is a proliferation marker that is associated with poor prognosis in breast cancer and an attractive therapeutic target. However, BIRC5 has not been well studied among racially diverse populations where aggressive breast cancers are prevalent. Experimental design We studied BIRC5 expression in association with clinical and demographic variables and as a predictor of recurrence in 2174 participants in the Carolina Breast Cancer Study (CBCS), a population-based study that oversampled Black (n = 1113) and younger (< 50 years; n = 1137) participants with breast cancer. For comparison, similar analyses were conducted in The Cancer Genome Atlas [TCGA N = 1094, Black (n = 183), younger (n = 295)]. BIRC5 was evaluated as a continuous and categorical variable (highest quartile vs. lower three quartiles). Results Univariate, continuous BIRC5 expression was higher in breast tumors from Black women relative to non-Black women in both estrogen receptor (ER)-positive and ER-negative tumors and in analyses stratified by stage (i.e., within Stage I, Stage II, and Stage III/IV tumors). Within CBCS and TCGA, BIRC5-high was associated with young age (< 50 years) and Black race, as well as hormone receptor-negative tumors, non-Luminal A PAM50 subtypes, advanced stage, and larger tumors (> 2 cm). Relative to BIRC5-low, BIRC5-high tumors were associated with poor 5-year recurrence-free survival (RFS) among ER-positive tumors, both in unadjusted models [HR (95% CI): 2.7 (1.6, 4.6)] and after adjustment for age and stage [Adjusted HR (95% CI): 1.87 (1.07, 3.25)]. However, this relationship was not observed among ER-negative tumors [Crude HR (95% CI): 0.7 (0.39, 1.2); Adjusted HR (95% CI): 0.67 (0.37, 1.2)]. Conclusion Black and younger women with breast cancer have a higher burden of BIRC5-high tumors than older and non-Black women. Emerging anti-survivin treatment strategies may be an important future direction for equitable breast cancer outcomes. Supplementary Information The online version contains supplementary material available at 10.1186/s13058-024-01792-y.


Introduction
Black women suffer 41% higher breast cancer mortality compared to White women [1].Differences in tumor biology at diagnosis (either due to differential risk factors or differences in detection) may contribute to this underlying disparity [2][3][4][5][6][7][8].While research and treatment advances have significantly lowered breast cancer mortality rates over the years, declines in mortality among Black women continue to lag behind [1].Therefore, there is a vital need for novel targets for therapeutic response in diverse breast cancer patients.Survivin is a protein in the inhibitor of apoptosis protein family encoded by the BIRC5 gene, and its mechanisms of action include inhibition of apoptosis, dysregulation of mitosis, cell cycle progression, carcinogenesis, and DNA repair [9].
Survivin is a marker of poor prognosis [10,11] and is commonly associated with enhanced proliferative index [12], reduced levels of apoptosis [13], resistance to chemotherapy [14][15][16], and increased rate of tumor recurrence [17] across multiple tumor types, including breast cancer.Survivin/BIRC5 is already included as a proliferation marker in two clinically utilized RNA-based prognostic assays in breast cancer, including the Oncotype DX assay [18] and Prosigna assay [19].Prior studies have shown that high survivin expression is associated with estrogen receptor (ER)-negative [20,21], high grade, and lymph node-positive breast tumors [22,23].However, most studies investigating survivin have been conducted in smaller cohorts of predominantly White women or that did not report on race [10,11,[20][21][22][23][24], and little is known about survivin in tumors from young and Black breast cancer patients, who are more frequently diagnosed with advanced disease, higher grade, and aggressive molecular subtypes [25,26].Currently, there are various methods of targeting survivin therapeutically, including small molecule inhibitors that block the function of survivin, interference with survivin gene expression, or survivin-based immunotherapy [27], making this a promising candidate for addressing disparities in outcomes.
Given that survivin/BIRC5 may be an attractive target for aggressive and resistant malignancies that lack effective therapies, we evaluated RNA expression of BIRC5 according to clinical and demographic variables in a large and diverse study population, the Carolina Breast Cancer Study (CBCS; N = 2174 cases: 1113 Black and 1061 non-Black; 1137 < 50 and 1037 ≥ 50 years of age) and compared these findings to those in the Cancer Genome Atlas (TCGA; N = 1095 cases: 183 Black and 816 non-Black; 295 < 50 and 798 ≥ 50 years of age).We hypothesized that in a diverse patient population, BIRC5 would be associated with aggressive disease and recurrence, suggesting potential value in targeted therapy.

Study population
The Carolina Breast Cancer Study (CBCS) [28] is a population-based study that utilized rapid case ascertainment with the North Carolina Central Cancer Registry to identify women aged 20-74 years across 44 counties diagnosed with first primary breast cancer.Recruitment occurred in three phases: 1993-1996 (Phase 1), 1996-2001 (Phase 2), and 2008-2013 (Phase 3).Black women and younger women (< 50 years of age) were oversampled using randomized recruitment [28], such that the final study population is 50% Black and 50% < 50 years old at diagnosis.Out of 4806 invasive breast cancer cases enrolled across all phases, 2174 bulk tumor samples were profiled by Nanostring (Phase 1: N = 259; Phase 2: N = 491; Phase 3: N = 1424).Exclusions included samples with depleted tissue (n = 1188, predominantly from CBCS1/2) or samples with low-quality or insufficient RNA (n = 241).This study was approved by the University of North Carolina at Chapel Hill (UNC-CH) School of Medicine Institutional Review Board in accordance with the revised U.S. Common Rule, and participants provided written informed consent.

Demographic and clinical characteristics
Health history and demographic variables were collected by a nurse during in-home interviews.Race was selfreported and categorized as Black and non-Black; > 94.7% of non-Black participants self-reported as White (n = 1005), while < 5.3% self-identified as either multiracial (n = 9, 0.85%), Hispanic (n = 15, 1.41%),American Indian/Eskimo (n = 8, 0.75%), Asian or Pacific Islander (n = 23, 2.17%) or Arab (n ≤ 5, < 1%).Importantly, we interpret race herein under a cells-to-society framework [29,30], that defines race as a social construct, representing the culmination of biological, social (individual and community-level), and environmental exposures that differ by self-reported race.Tumor size, AJCC stage, estrogen receptor (ER), progesterone receptor (PR), and HER2 receptor status were abstracted from medical records and pathology reports.
Recurrence data were available for CBCS Phase 3 (2008Phase 3 ( -2013; n = 1424).Recurrence-free survival (RFS) was defined as the time between the date of diagnosis to the first local, regional, or distant breast cancer recurrence and verified through medical record review.Recurrence data are complete through October 2019, with a 5-year follow-up completed for all study participants.Among 1424 eligible women, 50 participants were stage IV at diagnosis and excluded from the recurrence analysis.Among 1374 patients with Stage I-III breast cancer, 159 recurrences were identified within 5 years.

Gene expression data Normalization, molecular subtyping, and BIRC5
RNA was isolated from bulk tumor tissue using the Qiagen FFPE RNeasy isolation kit (Germantown, MD), assayed using Nanostring nCounter technology (Seattle, Washington), and normalized using Remove Unwanted Variation (RUV) as previously described [31][32][33].PAM50 molecular subtyping was performed using a research version of the predictor to classify tumors as Luminal A, Luminal B, HER2-Enriched, Basal-like, or Normallike, and to generate proliferation and risk of recurrence scores (ROR-PT) incorporating tumor size, proliferation and subtype [31,34].
BIRC5 was considered as both a continuous and categorical variable.For continuous measures of BIRC5, log2transformed gene expression was utilized in all analyses.Standardized clinical cutpoints do not exist for survivin/ BIRC5, and while it is a target of both OncotypeDX [18] and Prosigna [19] multi-gene assays, single gene levels are not established.Therefore, for use as a categorical variable, BIRC5 expression was dichotomized into BIRC5-low and BIRC5-high expression categories using the upper limit of the third expression quartile as a cut point (Log2 3rd quartile cutpoint: CBCS = 7.6; TCGA = 9.4).Differences in the expression of BIRC5 between CBCS and TCGA are likely a result of the different mRNA platforms used in each study (i.e., NanoString in CBCS, RNAseq in TCGA).All tumors were treatment naïve at the time of collection and prior to NanoString assay assessing BIRC5 mRNA expression.

Statistical analysis
Continuous BIRC5 expression levels were compared across race and clinical tumor characteristics using Welch's two-sample t-tests.Generalized linear models (glm) with binomial distribution and the identity link function were used to calculate relative frequency differences (RFDs) and 95% confidence intervals (CIs) as the measure of association between BIRC5 expression categories and covariates of interest in CBCS.RFDs are defined as the percentage difference between index and referent groups, namely, the relative frequency of BIRC5high tumors across demographic and clinical variables.With smaller sample sizes, RFDs could not be computed for TCGA because several models failed to converge.Thus, to measure the strength of association between BIRC5-high and covariates of interest in both CBCS and TCGA, multivariate logistic regression was used to calculate odds ratios (ORs) and 95% CIs.Multivariable models were adjusted for age and race according to the CBCS randomized recruitment design in reduced models, and additionally adjusted for ER status and tumor stage in full models.In models comparing age or race, age comparisons were only adjusted for race, and race comparisons were only adjusted for age.Similarly, in models additionally adjusting for ER status and stage, ER comparisons were only adjusted for stage, and stage comparisons were only adjusted for ER status.Multivariable analyses relied on complete case analysis as rates of missingness were < 1.3% for all covariates.Normal-like tumors were excluded from analyses because this subtype arises from insufficient tumor cellularity [31].
Kaplan-Meier curves and log-rank tests were used to compare mean time to recurrence across BIRC5 categories in stage I-III cases (n = 1374).Recurrence analyses were stratified according to clinical breast cancer subtypes (i.e., ER-positive/Her2-, and TNBC) and were performed across all tumor subtypes, overall.Hazard ratios (HR) and 95% CI were calculated using crude and multivariate Cox proportional hazard models adjusted for patient age and tumor stage.The Wald p-value was used to assess the assumption of proportionality.While there was evidence of non-proportional hazards, point estimates did not differ substantially between models.All statistical analyses were performed in R version 4.0.3.

Data availability
RNA sequencing and clinical data from TCGA breast cancer dataset, including 1095 primary tumors, were used to compare and validate BIRC5 relationships identified in CBCS.These data are publicly available under dbGaP accession phs000178.v1.p1, with additional data available at https://gdc.cancer.gov/about-data/publications/PanCan-CellOfOrigin 35.CBCS data are available upon request (https://unclineberger.org/cbcs).

BIRC5 expression, patient and tumor characteristics
The distribution of clinical and demographic characteristics in CBCS reflects its population-based sampling schema, with higher proportions of Black participants, higher proportions of participants < 50 years of age, and higher proportions of ER-negative, Basal-like, and Stage I cases compared to TCGA (Table 1).However, in both TCGA and CBCS, BIRC5-high tumors were more common among Luminal B (LumB), Her2-enriched, Basallike, ROR-PT-high and ER-negative tumor subtypes, as well as higher-stage tumors, and were more frequent among cases from Black women and younger women (< 50 years of age).BIRC5 is one of the genes used in the PAM50 subtype predictor, so we also performed a sensitivity analysis excluding BIRC5 from the algorithm and found that the distribution of BIRC5-high tumors remained very similar across PAM50 subtypes (Additional file 1: Table S1).Figure 1 shows that in univariate analyses among both CBCS and TCGA, continuous BIRC5 expression differs by race, even after stratification by tumor stage (I, II, III/IV; Fig. 1A) and ER status (positive or negative; Fig. 1B).
We next evaluated associations between categories of BIRC5 expression (i.e., tumors classified as BIRC5-high vs. BIRC5-low, defined as the upper quartile of RNA expression vs. all other quartiles) across the full CBCS and TCGA study populations.In both CBCS and TCGA, similar associations were observed for age at diagnosis, race, ER/PR/HER2 status, PAM50 subtype, tumor stage, and tumor size (Fig. 2, Table 2).To characterize these associations, we estimated relative frequency differences, defined as the difference between the proportions of participants with BIRC5-high tumors in each index group compared to the referent category.In the CBCS, BIRC5-high tumors were 12.1% more frequent among younger participants (< 50 years of age) and 11.7% more frequent among tumors from Black participants.In addition, BIRC5-high displayed strong relationships with aggressive tumor characteristics, with higher frequency among hormone receptor (HR)-negative tumors (BIRC5-high RFD for ER-negative: 27.3%, PR-negative: 21.1%) and aggressive PAM50 subtypes (BIRC5-high RFD for LumB: 33.0%, HER2-Enriched: 28.4%, and Basallike 49.8%).After additional adjustment for tumor characteristics (e.g., ER status and tumor stage), BIRC5-high remained significantly associated with young age, Black race, ER status, and tumor size (Fig. 2, left panel; Table 2).We also observed that stage II tumors had a higher frequency of BIRC5-high (compared to Stage I), although similar associations with Stage III/IV tumors were attenuated after additional adjustment.We performed a sensitivity analysis excluding non-Black participants that did not self-report White race [N = 56 (2.6%)] and the magnitude of associations in Table 2 were unchanged.
We also present odds ratios from multivariate logistic regression models, which converge better with the smaller cell sizes present in TCGA. Figure 2 displays odds ratios for the association between BIRC5-high,

Discussion
In this analysis, BIRC5/survivin was investigated as a biomarker in two large studies representing 3269 patients with breast cancer, including TCGA and the CBCS, a large and diverse population-based study enriched for Black and younger patients.In both studies, BIRC5 was associated with high-risk populations, including participants with aggressive tumor subtypes, advanced stage and larger tumors.Young women and Black women also had higher frequencies of BIRC5-high tumors.These differences persisted after adjustment for ER status and tumor stage, suggesting that BIRC5 associations are not driven exclusively by subtype and stage and may reflect additional biological, genetic or environmental influences.Higher BIRC5 was also prognostic for early recurrence among ER-positive participants in the CBCS, which is important given that the disparities in recurrence between Black and White women are greatest among ER-positive breast cancer [25,[36][37][38][39][40].
The results of our study aligns with prior work demonstrating an association between high survivin expression and aggressive breast tumor features including hormone receptor negativity, higher stage, larger size, and non-Luminal A subtype [20][21][22][23]40], all of which remained significantly associated, independent of estrogen receptor  status.BIRC5/survivin expression has also previously been reported as an independent marker of poor prognosis in breast cancer [24,41] however, the findings of the current analysis extend those prior investigations to a large and diverse patient population.The prognostic relationship with poorer RFS persists in this study.Given our finding of higher BIRC5/survivin and a previous study showing increased survivin phosphorylation in tumors from Black women [42], the burden of BIRC5-high may be particularly important for Black patients.
Our findings also shed light on previously reported BIRC5 associations with breast cancer clinical outcomes [24,41,43,44], which have seldom been stratified by clinical subtype.While BIRC5-high was more prevalent among TNBC tumors, BIRC5 had the strongest prognostic value among ER-positive/Her2-negative disease.This was also seen in the METABRIC cohort presented by Oparina et al., [40] where BIRC5 was only prognostic in the ER + group and not the ER-group.In contrast, Zhang et al [43] showed that survivin predicted survival in 136 TNBC patients.These inconsistent findings across studies highlight that variables mediating BIRC5/survivin responses remain poorly understood.One hypothesis is that in TNBC -a truly distinct disease with its own set of hallmark mutations [35], levels of genomic instability, and underlying tumor immune microenvironment -BIRC5/survivin has a distinct relationship with survival.Elucidating mediating events will be essential to understanding the treatment prospective of anti-survivin therapies.Based on our current results, BIRC5targeted therapies may be valuable, especially for patients with ER-positive tumors, the subtype with the largest Black-White outcomes disparity [25,[36][37][38][39][40].
There is high feasibility of translating anti-survivin therapy to breast cancer, as it has already been pursued as a cancer therapeutic target by various strategies [27,[45][46][47], and is already measured in the clinic on the validated prognostic assays, Prosigna [19] and Oncotype DX [18].A strength of our study was the ability to exclude BIRC5 from the PAM50 algorithm (the research version of the Prosigna assay) to independently assess BIRC5/survivin as a high-risk biomarker in breast cancer and its relationship with tumor subtype.Another strength was the use of a large, diverse population-based cohort that represents the natural distribution of breast cancer in the population, and for which RNA expression profiling was optimized for FFPE specimens.However, our analysis also had limitations.A limitation of our findings is that while we observed differences in BIRC5 expression by race, we are unable to evaluate the differential effects of BIRC5 in context of the social construct of race.Our targeted approach also does not allow for the investigation of survivin splice variants, which have been suggested to differ in function and according to prognostic significance [48,49].Further studies investigating the role of different survivin splice variants in diverse populations may be necessary for therapeutic stratification.Another limitation was the low number of HER2-positive tumors in our dataset, which did not allow for assessment of BIRC5-mediated recurrence among HER2-positive cases.Future studies should also consider longer follow-up times and detailed chemotherapy data to further disentangle the relationship between race, age, tumor subtype and survivin.Our results fill a research gap in understanding the potential role of survivin in breast cancer disparities, and possibly provide future insight into treatment strategies for a cohort of women with unmet clinical needs.Further studies are needed to help close this gap which constitutes the largest disparity among cancer-specific diseases.

Fig. 1 BIRC5
Fig. 1 BIRC5 Expression by Stage and Estrogen Receptor Status in Black and non-Black Patients in CBCS and TCGA.Boxplots displaying continuous log-2 BIRC5 mRNA expression among Black and non-Black breast cancer patients in CBCS (upper panels) and TCGA (lower panels) stratified by (A) tumor stage and (B) estrogen receptor status.Welsh's two-sample t-test p values listed within each plot.ER: estrogen receptor

Fig. 3
Fig.3Five-year recurrence-free survival (RFS) by BIRC5 expression status in CBCS.Kaplan-Meier survival analysis illustrating 5 year RFS in (A) all CBCS phase 3 cases, (B) among ER-positive/Her2-negative tumors only and (C) among triple-negative tumors only.Cox proportional hazard ratios and 95% confidence intervals adjusted for patient age and tumor stage are displayed within each plot for BIRC5-high relative to BIRC5-low tumors.All analyses were restricted to stage I-III tumors.Tick marks represent censored individuals.Shaded regions represent 95% confidence intervals for each group.ER: estrogen receptor; TNBC: triple-negative breast cancer; HR: hazard ratio; 95% CI: 95% Confidence Interval.Referent group = BIRC5-low for CoxPH models.
Foundation (OGUNC1202, OG22873776, SAC210102, TREND21686258) to MAT; and the National Cancer Institute of the National Institutes of Health (P01CA151135) to MAT, including the National Cancer Institute Specialized Program of Research Excellence (SPORE) in Breast Cancer (P50-CA058223), and the US Department of Defense (HT94252310235).In addition, this work was supported by the UNC Center for Health Innovation Pilot Award program to YA, R01CA253450 to MAT, F31CA257388 and L60CA284495 to AMH, and the University of North Carolina at Chapel Hill Cancer Control Education Program (T32CA057726) to ANW and AMH.This research recruited participants &/ or obtained data with the assistance of Rapid Case Ascertainment (RCA), a collaboration between the North Carolina Central Cancer Registry and UNC Lineberger.RCA is supported by a grant from the National Cancer Institute of the National Institutes of Health (P30CA016086).The Pathology Services Core is partly supported by the National Cancer Institute of the National Institutes of Health Center Core Support Grant (P30CA016080) and the University of North Carolina at Chapel Hill University Cancer Research Fund.

Table 1
Characteristics of the study population TCGA: the Cancer genome atlas; CBCS: Carolina breast cancer study; LumA: Luminal A; LumB: Luminal B; HER2: Her2-Enriched; Basal: Basal-like; ER: Estrogen Receptor; NA: not available; ROR-PT: PAM50 Risk of Recurrence Score; Missing values not included in percentages

Table 2
Associations between BIRC5-High, clinical, and demographic variables in the Carolina breast cancer study and cancer genome atlas TCGA: The Cancer Genome Atlas; CBCS: Carolina Breast Cancer Study; Lum A: Luminal A; LumB: Luminal B; ER: Estrogen Receptor; 95% CI: 95% confidence interval; DNC: does not converge; NA: Not available; RFD: relative frequency difference; OR: Odds Ratio.Ref.: Referent; Referent group = BIRC5-Low for all models.Null value for RFD models = 0.0; Null value for OR models = 1.0 a Models adjusted for race and age.b Models adjusted for race, age, tumor stage and ER status